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We analyze the hyperbolicity of real-world networks, a geometric quantity that measures if a space 
is negatively curved. In our interpretation, a network with small hyperbolicity is “aristocratic”, 
because it contains a small set of vertices involved in many shortest paths, so that few elements 
“connect” the systems, while a network with large hyperbolicity has a more “democratic” structure 
with a larger number of crucial elements. 

We prove mathematically the soundness of this interpretation, and we derive its consequences 
by analyzing a large dataset of real-world networks. We confirm and improve previous results on 
hyperbolicity, and we analyze them in the light of our interpretation. 

Moreover, we study (for the first time in our knowledge) the hyperbolicity of the neighborhood 
of a given vertex. This allows to define an “influence area” for the vertices in the graph. We show 
that the influence area of the highest degree vertex is small in what we define “local” networks, like 
most social or peer-to-peer networks. On the other hand, if the network is built in order to reach a 
“global” goal, as in metabolic networks or autonomous system networks, the influence area is much 
larger, and it can contain up to half the vertices in the graph. In conclusion, our newly introduced 
approach allows to distinguish the topology and the structure of various complex networks. 


I. INTRODUCTION 

The most basic way to describe a graph is to consider 
its metric quantities as for instance the diameter [5], the 
degrees, the distances [7]. In case no other information 
is available, a good choice is to consider randomly drawn 
edges ECU EE3 and determine the expected values and 
distribution of those properties in such random graphs. 
More recently, the computer revolution and the pervasive 
presence of Internet and WWW, created a whole series of 
complex networks in technological systems whose prop¬ 
erties can be directly measured from data [2]. All the 
real networks show particular structures of edges, mak¬ 
ing them definitely different from random graphs. Driven 
by such evidence, researchers recognized analogous struc¬ 
tures in a variety of other cases, ranging from biology 
to economics and finance [5]. All these structures show 
lack of characteristic scale in the statistical distribution 
of the degree and small world effect, making therefore 
important to understand the basic principles at the basis 
of their formation mmm- 

To bring order in this huge set of systems, it would 
be extremely useful if we could classify the various net¬ 
works by means of some specific quantity differing from 
case to case. In this quest of distinguishing universal 
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from particular behavior we decide to consider the con¬ 
nection with the “curvature” of the graph. Embedding 
spaces can have negative curvature (hyperbolic spaces), 
zero value of curvature (Euclidean spaces), or positive 
curvature (spherical spaces). On the bases of the hyper¬ 
bolicity measure m , it is possible to extend such a mea¬ 
sure of curvature for manifolds to discrete networks. Hy¬ 
perbolicity measure |16| defines the curvature for an infi¬ 
nite metric space with bounded local geometry, using a 4 
points condition. In detail, the hyperbolicity 6(x, y , v, w) 
of a 4-tuple of vertices {x,y,v,w} is defined as half the 
difference between the biggest two of the following sums: 

d(x,y)+d(v,w), d(x,v)+d(y,w), d(x,w)+d(y,v) (1) 

where d denotes the distance between two vertices. If 
the lengths of edges are integers (in our case, the length 
of each edge is assumed to be 1), the hyperbolicity of a 
4-tuple of vertices is always an integer or its half. The 
hyperbolicity 6(G) of a graph G is commonly defined as 
the maximum of the hyperbolicity of a 4-tuple of vertices 
m ® mi mm- However, for the purposes of this work, 
also the average hyperbolicity of a 4-tuple will have a 
significant role: to distinguish the two, we will use <5 wor st 
and <5avg, respectively [3j. 

This approach attracted the interest of the community, 
both in modeling this phenomenon |(U| . and in classifying 
networks from the real world pQ. For example, it has 
been argued that several properties of complex networks 
arise naturally, once a negative curvature of the space 
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has been assumed m- Similarly, others investigated the 
role of hyperbolicity in a series of different networks [3] 
ranging from social networks in dolphins, to characters in 
books, with the aim of discovering essential edges in path 
of communication. In addition, by studying structural 
holes, it has been shown that most of these networks are 
essentially tree-like [3|. 

Nevertheless, nobody provided an interpretation of 
what is measured by hyperbolicity, and in the literature 
there are few applications of this quantity. This paper 
wants to fill this gap, by considering hyperbolicity as a 
measure of how much a network is “democratic” (the 
larger the hyperbolicity value the larger the “democracy” 
in the network). Indeed, we prove mathematical results 
that link a small hyperbolicity constant with the exis¬ 
tence of a small set of vertices “controlling” many paths, 
and hence with a non-democratic network (implications 
are true in both directions). 

As far as we know, this is the first measure of democ¬ 
racy in a complex network, apart from assortativity 
ed mi- In any case, our measure is quite different 
from the latter one, because it is based on shortest paths 
and not on neighbors: consequently, the new measure is 
global. Moreover, it is more robust: for instance, if we 
“break” all edges by “adding a vertex in the middle”, the 
hyperbolicity of the graph does not change much, but the 
assortativity decreases drastically. 

Starting from this interpretation, we derive conse¬ 
quences on the structure of biological, social, and tech¬ 
nological networks, by computing (5 worst and <5 avg on a 
dataset made by 93 complex networks. This analysis 
confirms previous results showing that <5 wor st is highly in¬ 
fluenced by “random events”, and it does not capture a 
specific characteristic of the network [3] . Differently from 
previous works, we will also be able to quantify this phe¬ 
nomenon, showing that the distribution of 2<5 "g rBt (where 
D is the diameter of the graph) is approximately nor¬ 
mal. The value of d avg is instead much more robust with 
respect to random events, and it allows us to effectively 
distinguish networks of different kind. Our classification 
will be different from the classification provided by as¬ 
sortativity ED 122]: for instance, a network with few in¬ 
fluential hubs not connected to each other is democratic 
if we consider assortativity, while it is aristocratic in our 
framework. 

Finally, we introduce the hyperbolicity of vertex neigh¬ 
bors. This is done both starting from high-degree ver¬ 
tices and from random vertices. Our goal is to describe 
the “influence area” of such vertices. If a neighbor is not 
democratic, it means that this neighbor is in the influence 
area of the center (this implication is particularly evident 
when the center is a high-degree vertex). If the starting 
vertex v is random, the hyperbolicity of a neighbor grows 
quite irregularly, with local maxima when “more influen¬ 
tial” vertices are reached, and local minima when the 
neighbor size corresponds to the influence area of such 
vertices. On the other hand, if the starting vertex is al¬ 
ready influential, the plot grows almost linearly until the 


hyperbolicity of the whole graph is reached, and then it is 
constant. We have chosen to define the influence area of 
this vertex as the first neighbor where half this threshold 
is reached. 

The size of the influence area of the vertex with high¬ 
est degree provides a good way to classify networks. We 
are able to distinguish “local” networks, where each node 
creates connections to other nodes in order to reach their 
own goals, and “global” networks, where a common goal 
drives the creation of the network. For instance, peer-to- 
peer networks are local, because each node creates con¬ 
nections in order to download data, while metabolic net¬ 
works are global, because the creation of links is driven 
by the global goal of making the cell alive. We show that 
influence areas in a local networks is quite small, con¬ 
taining about vertices, where n is the total number of 
vertices. Conversely, influence areas in global networks 
have a much bigger size, close to | vertices. 

II. METHODS 

The main result of this paper is to show the link be¬ 
tween the hyperbolicity of a graph and its “democracy”. 
This interpretation is motivated by the following propo¬ 
sitions: the first one shows that, if for some vertices v, w, 
max X|3/ 6(x, y 1 v, w ) is not high, then there is a set of small 
diameter that “controls” all approximately shortest paths 
from x to y. Consequently, a network with low hyper¬ 
bolicity is not “democratic”, because shortest paths are 
controlled by small sets. 

Proposition 1. Let v,w be two vertices in a network 
G = (V,E), let B r (v) be the r-neighborhood of v (that 
is, the set {u £ V : d(u,v) < r}), and let B s (w) be 
the s-neighborhood of w. Then, the diameter of the set 
X = B r (v ) Cl B s (w) is at most 2 max Xj2/ S(x, y, v, w) + r + 
s — d(v, w). 

Proof. Let x,y be two vertices in X. Then, 

25(x, y, v, w) >d(x, y) + d(v, w) — ma x(d(x, v) + d{y, w), 
d(x,w ) -t -d(y,v)) > 

>d(x, y) + d(v, w) — (r + s). 

Taking the maximum over all possible x, y £ X, we ob¬ 
tain D(X) = max IjS6 x d(x, y) < 2nra x XtV S(x,y,v,w) + 
r + s — d(v,w), where D(X) is the diameter of the set 
X. □ 

The second proposition is a sort of converse: if there is 
a set of vertices controlling the shortest paths of a given 
4-tuple, the hyperbolicity of that 4-tuple is low. Conse¬ 
quently, if the hyperbolicity is high, then there is not a 
small set of vertices controlling many shortest paths, and 
the network is democratic. 

Proposition 2. Let x,y,v,w be 4-tuple of vertices, and 
let us assume that there exists a set C C V of diameter D 
such that all shortest paths between x, y, v, w pass through 
C. Then, 6(x,y,v,w) < D. 
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Proof. We may assume without loss of generality that 
d(x , y) + d(v, w) > d(x , v) + d(y , w) > d(x, w) + d(y, v). 
Then, if we denote by d(x, C) := min cS c' d(x, c), 

26(x , y, v , -u;) = d(x, j/) + d(i>, w) — d(x, v) — d{y , ui) 

< d(x, C) + D + d(C, y) + d(y, C) + D 
+ d(C,w) - d(x,C) - d{C,v) 

— d(y, C) — d(C, w) 

= 2D. 


□ 


These two results formalize our connection between the 
hyperbolicity constant and how democratic a complex 
network is. This work will confirm this interpretation by 
analyzing a dataset of 93 graphs, made by 19 biological 
networks, 32 social networks, and 42 technological net¬ 
works, and it will draw conclusions on which networks 
are more democratic than others. 

The first data check extends some of the activity al¬ 
ready done SHU]: for each network in our dataset, we 
computed the distribution of 2S y ) '-' , where D is the di¬ 
ameter of the graph (this value is always between 0 and 
i U5|). With respect to the previous papers, we got 
more data referring to larger networks, and we therefore 
deal with lower statistical errors. This is particularly 
important, since it is known that d wors t does not cap¬ 
ture structural properties of a network. Furthermore, its 
behavior is not robust, in the sense that small modifi¬ 
cations in the graph can significantly change the value 
of (5worst m ■ Moreover, computing <5 worst is difficult task, 
since the best “practical” algorithm T2] has running time 
0(n 4 ). We have been able to overcome these issues by 
focusing on i5 avg , already considered in the literature [3], 
but not deeply analyzed. In particular, for each graph 
in the dataset, we have considered the ratio , where 
d avg is the average distance between two randomly cho¬ 
sen vertices fl5l (this parameter takes values in the in¬ 
terval [0,1]). Although the exact computation of 6 avg 
is also hard, the value , avg can be easily approximated 
through sampling. More specifically, if we consider N 4- 
tuples of vertices with hyperbolicity <5i,..., Sn, and since 
a* = 0 < Si < 1 = bi for each i, by the Azuma-Hoeffding 
inequality H3 we obtain: 


EZi Si 


N 


-s, 


avg 


>t < 2e 2 £.-=i^-“« )2 < 


< 2e 2 . 


We choose to sample N = 10,000,000 4-tuples of ver¬ 
tices, obtaining an estimate S avg . The previous inequality 
applied with t = 0.001 yields: 

P (|6av g - <5av g | > 0.001) < 2e -5 < 0.7%. 

This precision is more than sufficient for our purposes. 


Finally, we have analyzed neighbors of a given vertex 
v, approximating their ratio ~f avg as before, in order to 
analyze the influence area of v. The size of the neighbor 
considered ranges from the degree of v to n, with steps 
of 10 vertices (where n is the total number of vertices in 
the graph). We have chosen v as the maximum degree 
vertex, which intuitively should have a big influence area, 
and we have used as a benchmark of comparison the same 
results from a random vertex. 

We have defined the “influence area” of v as the biggest 
neighbor where 2 f avg is at most half than the same value 
in the whole graph. However, in order to avoid “ran¬ 
dom deviations” (especially, when the neighbor is small), 
in our experiments we have considered the fourth neigh¬ 
bor where this event has occurred. The purpose of this 
analysis is twofold: not only we define and compute the 
influence area of a vertex, but we also classify networks 
according to the size of this influence area. 


III. RESULTS 
A. Worst-Case Hyperbolicity. 

The most common approach is to consider the max¬ 
imum hyperbolicity of a 4-tuple of vertices, that is, 
<5worst (G). Despite some attempts in proving that real- 
world networks usually have low hyperbolicity [18] , it 
soon became clear that the small values obtained are con¬ 
sequences of small world networks [3], since 0 < 25 worst < 
D (where D is the diameter of the graph). In particular, 
in a dataset of small social and biological networks [3] 
there is no relation between 2(5 wor st and D. The ratio 
between these values varies between 25% and 89%. 

In this paper, we make a more detailed analysis, work¬ 
ing with larger networks, between one hundred and sev¬ 
eral thousand of vertices. The results obtained are shown 
in Figure [l] which also contains specific results dealing 
only with social, biological and technological networks. 

These results show that the distribution of the ra¬ 
tio 2 ^ rat is approximately Gaussian, both in the whole 
dataset and in each single kind of network. The aver¬ 
age ratio is 0.521, and the standard deviation is 0.085. 
Moreover, a Chi-square goodness of fit test applied to the 
previous data does not reject the hypothesis that the dis¬ 
tribution is Gaussian with mean 0.5 and variance 0.085, 
with a very high confidence level [23]. This result con¬ 
firms that the hyperbolicity of real-world networks is not 
much “smaller than expected”, result already obtained 
in the past [3]. However, we are able to perform a fur¬ 
ther step: the Gaussian probability distribution makes us 
think that <5 wor st is influenced by random events. Indeed 
it does not reflect particular characteristics of the net¬ 
work, since the same distribution arises from networks of 
different kinds. 

Social networks show a slightly different behavior, 
since many of them have a larger value of 2<5 "g rat , be- 
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FIG. 1. The distribution of 2l5 '*g rat in the graphs in our dataset. The bar corresponding to the value p contains all networks 
where p — 0.5 < 2S ™ rBt < p + 0.5. 


tween 0.65 and 0.75. However, this is due to the pres¬ 
ence of several financial (e-MID, a platform for interbank 
lending) networks, where the ratio is often | or | since 
the diameter is 3 or 4. 

Despite this particular case, we may conclude that 
^worst is not a characteristic of the network, but it mainly 
depends on “random events” that have a deep impact on 
the value of 5 wors t- This conclusion is further confirmed 
by the particular case of the e-MID networks: this pa¬ 
rameter changed from 0.750 in 2011 to 0.286 in 2012, only 
because a simple path of length 3 increased the diameter 
from 4 to 7. 


B. Average Hyperbolicity. 

In the past, the average hyperbolicity d avg of a 4- 
tuple of vertices was rarely analyzed: the only known re¬ 
sult is that it is usually significantly smaller than <5 wor st 


[3]. However, we think that this parameter may pro¬ 
vide very interesting results, because it is robust, in the 
sense that it does not change much if few edges of the 
graph are modified. Furthermore, it is easily approx- 
imable through sampling (while computing d worst takes 
time 0(n 4 )). Similarly to what we have done in the anal¬ 
ysis of dworst, we have considered the ratio , where 
d aV g denotes the average distance in the network (also 
this parameter lies in the interval [0,1]). The results ob¬ 
tained are plotted in Figure [2j 

The picture shows that the average hyperbolicity is 
usually an order of magnitude smaller than the aver¬ 
age distance: in this sense, real-world networks are in¬ 
deed hyperbolic. Moreover, it is possible to make a dis¬ 
tinction between networks that are “more democratic”, 
like the e-MID networks or the peer-to-peer networks 
(where the hyperbolicity is large), and networks that are 
“more centralized”, like some social networks and most 
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FIG. 2. The value 


of all the networks in our dataset 
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autonomous systems networks. 

C. Hyperbolicity of Neighbors. 

Since the hyperbolicity of a graph is closely related to 
the existence of a small part of the graph controlling most 
shortest paths, we have analyzed which subgraphs of a 
given graph have small hyperbolicity. Intuitively, these 
subgraphs should be “less democratic” than the whole 
graph, in the sense that they are contained in the “influ¬ 
ence area” of a small group of vertices. In this analysis, 
we have tried to spot the influence area of a single vertex, 
by measuring 2 ^ avg on neighbors of v in increasing order 
of size. In order to prove the effectiveness of this ap¬ 
proach, we have first tested a synthetic power-law graph 
[20] made by three communities of 1000 vertices each 
(see the lowest plot in Figure [3]). We have computed the 
hyperbolicity of neighbors of the vertex v with highest 
degree: we can see a local minimum close to the size 
of a community. In our opinion, this minimum appears 
because the neighbor is “dominated by the community”, 
and consequently by the center v of the community. This 
result confirms the link between the value of 2 f avg and the 
influence area of a vertex. 

Finally, we passed to the analysis of neighbors in real- 
world networks. The upper plots in Figure [3] show the 
same results for one network of each kind: 

• a social network, the General Relativity and Quan¬ 
tum Cosmology collaboration network; 

• a biological network, the yeast metabolic network; 

• a technological network, the peer-to-peer Gnutella 
network in 2004. 

As a benchmark of comparison, we have also considered 
the hyperbolicity of neighbors of a random vertex. 

The plots show that the hyperbolicity of a neighbor of 
the maximum degree vertex grows almost linearly with 
the neighbor size, until it converges to the hyperbolic- 
ity of the whole graph. Convergence time differs from 
graph to graph. In biological networks, convergence was 
reached at size close to j, while in the social and in 
the technological networks convergence is reached before. 
For neighbors of a random vertex, we outline a different 
behavior: at the beginning, the growth is not monotone, 
like in the previous case, and it is much more irregular. 
In our opinion, this is due to the fact that, when the 
neighbor grows, it reaches more and more “influential” 
vertices, and the first neighbor that touches such a ver¬ 
tex corresponds to a local maximum in the plot. After 
some steps, the hyperbolicity grows more and more reg¬ 
ularly, because we have reached a very influential vertex 
w, and from that point on we are mainly considering the 
influence area of w, not of v. This issue is further con¬ 
firmed by Figure [4] where the derivative of the average 
hyperbolicity is shown. Hence, this experiment outlines 


two significant point: on the one hand, it allows us to 
define the influence area of v as the neighbor where the 
first local maximum is reached; on the other hand, it 
gives motivations to the analysis of neighbors of influ¬ 
ential vertices, because their influence area has a strong 
impact on the topology of the whole network. 

For this reason, we have focused on the maximum de¬ 
gree vertex, and, in order to have more general results, we 
have analyzed all graphs in the dataset. Figure [5] shows 
the size of the maximum neighbor having ratio , avg at 
least half than the same ratio in the whole graph. Ac¬ 
tually, in order to exclude random deviations from our 
analysis, we have plotted the fourth neighbor where the 
condition is satisfied. The figure shows that the influence 
area of an individual is low in social and peer-to-peer 
networks, compared to biological or autonomous system 
network. 

This standard behavior has few exceptions: first 
of all, protein-protein interaction networks (string, 
ecoli. interaction) are different from other biological 
networks, and the influence area is smaller. Further¬ 
more, the social network GoogleNW contains a vertex with 
an enormous influence area: this network is the set of 
Google pages, and the central vertex v considered is the 
page www.google.com, which clearly dominates all the 
others. Another particular case is the social network 
f acebook_combined: this network is a collection of ego- 
networks from Facebook, and links are made if common 
interests are retrieved. We think that this network is dif¬ 
ferent from the others because it is a small subgraph of a 
bigger graph (where all Facebook users are considered), 
and the choice of the subgraph has a strong impact on 
the topology of the network, which does not reflect the 
standard behavior. 


IV. DISCUSSION 

In the literature, several works have analyzed the hy- 
perbolicity of a complex network. They used this quan¬ 
tity in order to classify real-world networks and in order 
to draw conclusions about the impact of hyperbolicity on 
the network topology. However, these works are mainly 
based on the analysis of d W orst, which has two drawbacks: 
it is not robust , that is, small modifications on the net¬ 
work can have deep impacts on its value, and it is not 
scalable , that is, it can be exactly computed only on small 
networks. In this work, we confirmed these conclusions, 
and we have proposed a different approach: using <5 avg 
instead of (5 worst , a parameter already considered in the 
literature. We interpreted this parameter as a measure 
of “democracy” in a network, and we classified different 
networks according to how democratic they are. 

In particular, we have shown that technological au¬ 
tonomous system networks are less “democratic” than 
social or biological networks, in agreement with our intu¬ 
ition (since AS graphs have a “built-in” hierarchy, while 
in social networks everyone has the same role). More- 
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FIG. 3. The value of d avs for neighbours of a randomly chosen vertex (up left), or the maximum degree vertex (up right); S n 
is the number of vertices in the neighbours, while S is the total number of vertices. Results are shown for a social network, a 
biological network, a technological network, and (below) a synthetic network. 


over, we have applied this concept to neighbors of in¬ 
fluential nodes: we have seen that larger neighbors are 
more democratic, and the relationship between the size 
and the hyperbolicity is almost linear, until a threshold is 
reached. This analysis clearly outlines the influence area 
of a node, whose size strongly depends on the graph con¬ 
sidered. We have shown that nodes have a rather small 
influence area in social and peer-to-peer networks, while 
in autonomous systems and biological networks the in¬ 
fluence area can be close to half the graph. A possible 
explanation of this behavior is that the former networks 
are “distributed”, in the sense that each node has a goal 
(downloading in peer-to-peer networks, and creating re¬ 
lationships in social networks), and edges are created lo¬ 
cally by nodes that try to reach the goal. On the other 
hand, the latter networks have global goals (connecting 
everyone in the network, or making a cell live), and the 
creation of edges is “centralized”. Our analysis is able to 


distinguish graphs of these two kinds. 

These results prove that democracy in a complex net¬ 
work is well formalized by our definition of hyperbolicity, 
since the consequences of our interpretation are coher¬ 
ent with intuitive ideas. Furthermore, we have provided 
an application of this interpretation, making it possible 
to define and analyze the influence area of a node. As 
far as we know, this is the first work that provides this 
interpretation of the average hyperbolicity of a graph. 

Possible applications include not only the classification 
of networks according to this parameter, but also the 
classifications of nodes in a network, or the classification 
of different communities. These communities might be 
democratic, if everyone has “the same role” and the hy¬ 
perbolicity is high, or not democratic, if there is a group 
of few nodes that keeps the community together, making 
the average hyperbolicity small. Future researches may 
address this characterization, and they may also clas- 







































sify nodes in a network according to their influence area, 
in order to distinguish important hubs from peripheral 
node. Finally, it would be interesting to compare this 
approach with other approaches to determine the central¬ 
ity of a node (closeness centrality, betweenness centrality, 
eigenvector centrality, and so on), in order to better out¬ 
line the features of this analysis. 


FIG. 4. The derivative with respect to the neighbors propor¬ 
tion S„/S of the value of 2 f avs , in neighbors of the maximum 

' “avg 

degree vertex and of random vertices. 
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FIG. 5. The size of the fourth neighbour of the maximum degree vertex having , avs at least half than the same value in the 
whole graph. 



